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A theoretical approach is developed to quantify hydrophobic hydration and interactions on a molecular scale, 
with the goal of insight into the molecular origins of hydrophobic effects. The model is based on the fun- 
damental relation between the probability for cavity formation in bulk water resulting from molecular-scale 
density fluctuations, and the hydration free energy of the simplest hydrophobic solutes, hard particles. This 
probability is estimated using an information theory (IT) approach, incorporating experimentally available 
properties of bulk water - the density and radial distribution function. The IT approach reproduces the 
simplest hydrophobic effects: hydration of spherical nonpolar solutes, the potential of mean force (PMF) 
between methane molecules, and solvent contributions to the torsional equilibrium of butane. Applications 
of this approach to study temperature and pressure effects provide new insights into the thermodynamics 
and kinetics of protein folding. The IT model relates the hydrophobic-entropy convergence observed in pro- 
tein unfolding experiments to the macroscopic isothermal compressibility of water. A novel explanation for 
pressure denaturation of proteins follows from an analysis of the pressure stability of hydrophobic aggre- 
gates, suggesting that water penetrates the hydrophobic core of proteins at high pressures. This resolves 
a long-standing puzzle, whether pressure denaturation contradicts the hydrophobic-core model of protein 
stability. Finally, issues of "dewetting" of molecularly large nonpolar solutes are discussed in the context of 
a recently developed perturbation theory approach. 



I. Introduction 

Hydrophobic interactions play a central role in many 
self-assembly processes in aqueous solution: protein fold- 
ing; the formation of micelles, membranes, and com- 
plex mesophases in surfactant solutions; the formation 
of macromolecular complexes; and the non-specific ag- 
gregation of proteins in inclusion bodies during over- 
expression or in disease-causing amyloid plaque forma- 
tion in brain tissue. 1 ' 2,3 ' 4,5 Biopolymers such as pro- 
teins, nucleic acids, and lipids contain a significant frac- 
tion of nonpolar groups: aliphatic and aromatic amino- 
acid side chains, the faces of nucleic-acid bases, and 
the hydrocarbon tails of lipids. Attractive hydropho- 
bic interactions between these nonpolar groups con- 
tribute significantly to the stability of folded proteins, 
base-stacking in helical nucleic acids, and membrane bi- 
layers. Protein folding may be the most extensively 
characterized macromolecular self-assembly process in 
aqueous solution. Thermodynamic 1 ' 2,6,7,8 and structural 
studies 9,10 on proteins have shown that the formation of 
a hydrophobic core, comprising predominantly nonpolar 
amino acids, plays a significant if not the dominating 
role in protein folding. Clearly, other interactions, such 
as electrostatic interactions and intramolecular or water- 
mediated hydrogen bonding, cannot be neglected in a full 
treatment of the thermodynamic stability of amphiphilic 
molecules and their assemblies. Nevertheless, a quan- 
titative understanding of the stability of hydrated am- 
phiphilic macromolecules necessitates a thorough, quan- 
titative description of hydrophobic driving forces as a 
major contributing factor. 

Despite many decades of research on hydrophobic ef- 



fects, our understanding of key phenomena presumably 
of hydrophobic origin is still incomplete, and seemingly 
contradictory observations have yet to be explained. Sur- 
prisingly, these include the effects of changing even the 
most elementary thermodynamic variables, temperature 
and pressure, or key solution properties, such as salt con- 
centration and composition. The well-known fact that 
proteins denature at elevated temperatures and at el- 
evated pressures is an immediate example. Heat de- 
naturation can be explained by the hydrophobic core 
model of proteins, which was established on the ba- 
sis of extensive temperature unfolding studies. 6,7,8,11,12 
These studies show that the thermodynamic character- 
istics of heat denaturation are similar to those of trans- 
ferring nonpolar molecules from a hydrocarbon or gas 
phase into water. This suggests that nonpolar residues 
in the protein interior become exposed to solvent dur- 
ing denaturation. However, as pointed out poignantly 
by Kauzmann, 13 pressure denaturation appears to con- 
tradict the hydrophobic-core model of proteins in the 
sense that the hydrocarbon transfer process and protein 
denaturation exhibit diametrically opposite pressure de- 
pendences of the associated volume change. 

One goal of our work has been to reconcile these appar- 
ently contradictory results for temperature and pressure 
denaturation of proteins within the context of the hy- 
drophobic core model of protein folding. To achieve this, 
we developed detailed molecular explanations for the un- 
derlying processes. 14,15 Here, we will focus specifically on 
the observed temperature convergence of hydrocarbon- 
transfer entropies from calorimetry experiments. 1 That 
entropy convergence coincides with the observed tem- 
perature convergence of the entropy of protein unfolding 
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and thus provides a thermodynamic foundation for the 
hydrophobic-core model of protein folding. 6 ' 7,11,12 ' 16 ' 17 ' 18 
In addition, we will suggest a molecular mechanism 
for pressure denaturation of p ro teins 19,20,21,22,23 ' 24 ' 25 ' 26 
and discuss its consequences on folding kinetics and 
the characteristics of the ensemble of unfolded protein 
structures. 15 Our explanation of pressure denaturation 
invokes experimental observations regarding the differ- 
ences in the structures of heat and pressure-denatured 
proteins, the latter being more compact. 23,27 ' 28 We will 
also give preliminary results for the effects of salt con- 
centration on hydrophobic hydration, as characterized by 
the Hofmeister series, 29 which ranks salts by their abil- 
ity to increase ("salt in") or decrease ("salt out") the 
solubility of nonpolar molecules in water. 

To achieve these goals, we need a model that quanti- 
fies hydrophobic hydration and hydrophobic interactions, 
and the effects of temperature, pressure, and salt concen- 
tration on them. A number of models of the hydrophobic 
effect have been developed in the past, 1,2,4,30,31,32,33,34 
all having their merits and limitations. 35 One of the 
most influential models is scaled-particle theory (SPT) 36 
and its extension to hydrophobic hydration. 37,38 SPT 
laid a molecular foundation for surface area mod- 
els of hydrophobic hydration with macroscopic surface 
tensions. 39,40,41,42 ' 43 ' 44 ' 45 ' 46 However, this invocation of 
macroscopic parameters limits applications of SPT to 
study the subtle changes due to temperature and pres- 
sure, for example, that are closely associated with the 
molecule-scale properties. A thoroughly molecular the- 
ory was developed by Pratt and Chandler (PC), 47 based 
on the theory of simple liquids. The PC theory invokes 
radial correlation functions in the context of Ornstein- 
Zernike integral equations with appropriate closures. 
Lazaridis and Paulaitis 48 (LP) use an expansion of the 
conformational entropy in terms of particle distribution 
functions. Both PC and LP theories relate the structural 
ordering of water by the nonpolar solute to the hydration 
thermodynamics through solute- water pair correlations. 

Probably the most powerful approach is the direct 
computer simulation of hydrophobic hydration phenom- 
ena using molecular dynamics or Monte Carlo methods. 49 
Computer simulations arc flexible and, with the advent 
of ever faster computers, the system size and time scale 
limitations are becoming less of a concern. However, the 
interpretation of computer simulation results necessitates 
a well-founded theoretical framework. Here, and in the 
aforementioned theories, computer simulations provide 
critical input that is not readily available from experi- 
ments. 

The theoretical approach pursued here describes hy- 
drophobic hydration and hydrophobic interactions using 
chemical potentials of the simplest hydrophobic solutes 
in water - "hard" molecules exerting entirely repulsive 
interactions on water molecules. The solvation chemical 
potentials for hard-core solutes are directly related to the 
presence of cavity volumes of molecular size due to den- 
sity fluctuations in bulk water. We quantify these density 
fluctuations using an approach motivated by informa- 
tion theory (IT). 50 From a maximum-entropy principle, 51 



we find the "best-possible" description of the fluctua- 
tions that satisfies certain experimental constraints. The 
virtues of such an approach are its simplicity, efficiency, 
and accuracy for molecule-size solutes. In addition, the 
IT model expresses the simple hydrophobic phenomena 
using properties of bulk water alone. This allows us to 
relate hydrophobic effects to the peculiar properties of 
water that distinguishes it from nonpolar solvents. The 
IT model builds on concepts introduced in SPT, specif- 
ically the relation between the probability of finding a 
molecule-size cavity in water and the chemical potential 
of solvation of hard particles. In its simplest form, the IT 
model is also related to Ornstein-Zcrnikc integral equa- 
tion theories and the PC theory 50,52 ' 53 ' 54 

The paper is organized as follows: After developing 
the IT model for hard nonpolar solutes and a general- 
ization to continuous solute-solvent interactions, we will 
discuss practical aspects of the implementation. Limi- 
tations in the macroscopic limit will be analyzed before 
we view the IT model from a historical perspective. We 
will then present results for the simplest hydrophobic ef- 
fects - solvation of spherical solutes, methane-methane 
interaction in water, and contributions to the torsional 
equilibrium of butane. The temperature dependence of 
hydrophobic hydration and the entropy convergence will 
be studied subsequently. We will then quantify the ef- 
fects of pressure on hydrophobic interactions, leading to 
a model for the pressure denaturation of proteins. The 
effects of salt on hydrophobic hydration will be discussed 
briefly. We will also introduce a recently developed per- 
turbation theory model 55 based on the energetic loss of 
water molecules at the solute- water interface. This model 
allows us to extend our analysis of hydrophobic effects 
further to mesoscopic and macroscopic solutes. We will 
conclude with an attempt to give answers to the question 
how water differs from hydrocarbon liquids as a solvent 
for nonpolar solutes. 

II. Information theory model of hydrophobic hy- 
dration 

a. Simple hydrophobic effects: thermodynamics 
of dissolving a hard particle in water 

The simplest hydrophobic solute is a cavity or, equiva- 
lently, a hard-core particle which excludes water-oxygen 
atoms from a volume v of a given shape and molecular 
size. Cavity formation constitutes the important, first 
step in solvating any nonpolar solute in water. The sol- 
vation thermodynamics of a hard particle is determined 
by the excess chemical potential yu cx corresponding to 
the free energy of its transfer from an ideal gas into the 
aqueous environment. Statistical mechanics relates this 
excess chemical potential to the probability po of finding 
an empty volume v in water, i.e., a cavity of a given size 
and shape, 36 ' 56 

^ = -k B Tlnp . (1) 

To determine the chemical potential ^i cx , we need to 
quantify the probability p of successfully inserting a 
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hard-core solute of a given size and shape into equilib- 
rium conformations of water, as illustrated in Figure 1. 
A virtue of such an approach is that the solvation ther- 
modynamics characterized by /i ox is determined by the 
properties of pure water, with the solute entering through 
its molecular volume v. 
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Figure 1. Schematic two-dimensional representation of 
the probabilities p n of observing n solvent centers inside 
a randomly positioned volume v in the bulk fluid. Two 
cases are shown: A successful insertion with n = that 
would contribute to po, and an insertion with n = 3 
contributing to p 3 . 

b. Information theory approach to excess chemi- 
cal potentials of solvation 

The goal is to estimate accurately the probability po 
that a given hard-core solute inserted into water docs 
not overlap with any of the solvent centers, defined as 
the positions of water-oxygen atoms. The IT approach 
does not attempt to model this quantity directly. In- 
stead, it focuses on the set of probabilities p n of find- 
ing n water-oxygen atoms inside the observation volume, 
with p being just one of the p n . We will attempt to get 
accurate estimates of the p n , and po in particular, us- 
ing experimentally available information as constraints 
on the p n . The moments of the fluctuations in the num- 
ber of solvent centers within the observation volume v 
provide such constraints. 

For a given observation volume v in bulk water, the 
moments of the fluctuations in the particle number n are 
determined from the p n as follows: 

oo 

<n fe ) = ]>>„n\ (2) 

n=0 

where (• • •) denotes a thermal average. The zeroth, first, 
and second moment can be expressed in terms of exper- 
imentally accessible quantities: 

oo 

(l) = J2Pn = l (3a) 

oo 

( n ) = ^2 p« n = p v > ( 3b ) 

n=0 



(n 2 ) = f>„n 2 = (n) + p 2 [ dr f dv' g{\v - r'|) , (3c) 

where p is the number density of bulk water and g(r) 
is the radial distribution function between water-oxygen 
atoms in bulk water, which can be determined from X- 
ray or neutron scattering measurements or computer sim- 
ulations. These moment conditions provide constraints 
on the PnS, and guarantee that the p n are normalized 
and have the correct first and second moments. Higher 
moments would require knowledge of triplet and higher- 
order correlation functions which are not generally ac- 
cessible to experiments, but can be calculated from com- 
puter simulations. 58 

The IT approach attempts to provide the "best- 
possible" estimate of the probabilities p n under the con- 
straints of the available information, 51 defined as the set 
{p n } that maximizes an information entropy rj subject to 
the information constraints, 

max v({Pn}) ■ (4) 

{constraints} 

In the most general form, we adopt a relative or cross 
entropy, 59 

OO , s 

V({Pn}) = -J2Pn^ - . (5) 
n=0 \ PnJ 

where p n represents an empirically chosen "default 
model." 

We consider two natural choices of default models: the 
Gibbs default model p n oc 1/nl, which leads to a Poisson 
distribution for a given mean, as would be expected for an 
ideal gas, and a flat distribution (p n = 1 for n < n ma x and 
p n = otherwise), which results in a discrete Gaussian 
form of p n with given mean and variance. Empirically, 
we find that the latter choice is accurately applicable to 
molecule-size cavities. 50 
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Figure 2. Probabilities p n for observing n water- 
oxygen atoms in spherical cavity volumes. 50 Results from 
Monte Carlo simulations of SPC water 57 are shown as 
symbols. The parabolas are the predictions of the flat 
default model of the IT approach. The center-to-center 
exclusion distance d (in nanometers) is noted next to the 
curves. 
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Maximizing the information entropy under the con- 
straints of eq g leads to 



n f. „Ao+Airi+A2ra 

Pn — Fn c 



(6) 



where Ao, Ai, and A2 arc the Lagrange multipliers chosen 
to satisfy the moment conditions eq |[ 

Figure 2 shows p n distributions for spherical observa- 
tion volumes v calculated from computer simulations of 
SPC 57 water. 50 The solute exclusion volume is defined by 
the distance d of closest approach of water-oxygen atoms 
to the center of the sphere. For the range of solute sizes 
studied, we find that lnp n values are closely parabolic in 
n. This would be predicted from the flat default model 
with n max — > 00, as shown in Figure 2. 
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Figure 3. Effect of increasing the number of mo- 
ments used in the IT calculations. Shown as a function 
of the number of moments is the predicted excess chem- 
ical potential of a methane-size hard sphere with exclu- 
sion radius d = 0.33 nm for solvation in SPC/E water 60 
(left-hand scale). Also shown for reference is the actual 
chemical potential (dot-dashed line). The Shannon in- 
formation I (right-hand scale) illustrates that the third 
and higher moments do not result in an appreciable gain 
in information. 

Figure 3 illustrates the effect of including higher mo- 
ments in the IT model. Results are shown both for the 
Gibbs and the flat default models. We find that the 
prediction of /i cx is greatly improved when the second 
moment (i.e., the variance of the particle number) is 
included in addition to the mean. However, inclusion 
of higher moments initially makes the prediction worse. 
Only when seven or more moments are used is the predic- 
tion as accurate as the two-moment model. Also shown in 
Figure 3 is the calculated Shannon information 7({p„}), 



the estimated values of po and /^ cx for the solute-size 
range considered in Figures 2 and 3. 

c. Continuous information theory 

A more general statement of the principles underly- 
ing the IT approach follows from a generalization to 
continuous interactions between the solvent and the hy- 
drophobic solute. Such a generalization can be achieved 
by explicitly considering solvent positions within a mi- 
croscopic volume, in addition to the occupancy num- 
bers considered in the basic IT approach. We de- 
fine probability distributions p(j;t"i, . . . , r,-) of observ- 
ing exactly j particles within the observation volume v 
at positions ri , . . . , Vj in infinitesimal volume elements 
dr\, . . . , dxj. The p(j; r±,...,Tj) allow us to calculate 
solvation chemical potentials of a solute with continu- 
ous interactions u(ri,r2, . . .) between solvent molecules 
at positions ri , r 2 , ■ ■ ■ and a solute at the origin. Within 
the observation volume v, the solute-solvent interactions 
are treated explicitly. Finite-range interactions provide a 
natural choice for i>; otherwise, a finite volume v can be 
chosen with corrections for long-range interactions. 

Widom's formula 62 relates the excess chemical poten- 
tial of the solute fi ex to the normalized p(j; ri, . . . , r^), 



E 



-/3«(ri,r 2 ,...) 



n / *< 



P(j') T U ■ ■ ■ , T j) e ' 



-/3u(n, 



(8) 



where = k^T. 

Calculating p(j';ri, . . . ,Tj) directly would require 
knowledge of higher-order correlation functions. Instead, 
our goal is to infer the p(J; ri, ... ,Tj) from available in- 
formation, such as the one- and two-particle densities in 
the observation volume v, expressed as constraint func- 
tional: 



(9a) 



5(r- ri)p(j;ri 



• r i) 



P V 2) (r, r')/2 = (J2 S(t - r a )S(r' r 7 )> 



(9b) 
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E 

J>2 



i=3 



dr 



6(r - n)S(r - r 2 )p(j;n, . . .,r 3 ) 



I({Pn}) 



E 



Pn hiPr, 



(7) 



Including the second moment results in a large gain of 
information, whereas the gains from including additional 
higher moments are small: moments of order three and 
higher are, in this respect, "un-informative," as measured 
by the gain in Shannon information I({p n }) (see also 
Ref. 61). Those higher moments do not significantly alter 



where r and r' are positions inside the observation vol- 
ume v. For a homogeneous fluid, the density will be uni- 
form, i.e., p(r) will be independent of position; and the 
two-particle density distribution will be the radial distri- 
bution function of the homogeneous system, g^(r, r') = 
9(\r-r'\). 

In analogy to eq 0, we define an information entropy 77 
for continuous solvent positions and discrete occupancy 
numbers 63 
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n / *< 



p(j;ri, . . .jTj^ln^jlpO'sri, . . .,rj)]. 

(10) 



We include the factor of in the logarithm of eq [To] for 
dimensional consistency. Maximizing this entropy func- 
tional under the constraints of eq ^| results in an expres- 
sion for the probability distribution p(J; r%, . . . , rj), 

3 

-/T 1 ln[w»i!p(j; n, • • • , rj)] = E w(1) ( r ^ 

fc=i 

+ i^/3^ 2 )(r fc ,r i ). (11) 

The Lagrange multipliers oj^^rj.) and a/ 2 -*^, r/) are 
chosen such that the p(j; n, . . . , r 3 -) satisfy the constraint 
functionals eq ^. Incorporating additional n-particle cor- 
relation information (n > 2) is straightforward. In a 
practical implementation, one can subdivide the obser- 
vation volume into a finite number of volume elements. 
The constraint functionals, eqs ^, then reduce to a finite 
number of constraints. 

Note that the probability distributions p(j; ri, . . . , r.,-) 
have a Boltzmann-Gibbs structure, i.e., they are propor- 
tional to an exponential of effective interactions ui^'fak) 
and Li/ 2 ) (rfc, rj) in k^T units, divided by j!. Summation 
and integration of the p{j\r\, ■ ■ ■ ,Tj) results in the fa- 
miliar form of a grand-canonical partition function for a 
microscopic volume v embedded in a large bath of sol- 
vent molecules. The effective interactions entering the 
Boltzmann-Gibbs factors in this grand-canonical parti- 
tion function are the Lagrange multipliers a/ 1 )^) and 
ui 



(2) 



(rfc,rj). These effective interactions are chosen to 
satisfy available information constraints, rather than be- 
ing derived directly from integrating out bath degrees of 
freedom. 

wW(r) is expected to be a spatial constant approxi- 
mately the negative chemical potential of the solvent. 55 
Similarly, a/ 2 ) (r, r') will be approximately an interatomic 
pair potential, except in a small region near the surface 
of the subsystem. 



d. Some practical aspects of the information the- 
ory model 

Here, we provide details on the practical implementa- 
tion the IT model. Before discussing specific details, we 
note that the required moments of particle-number fluc- 
tuations can be obtained in situ from simulation data. 
This idea is most direct for a uniform liquid. The ob- 
servation volume v is planted as a stencil in the simula- 
tion volume and the required moments are extracted as 
averages over a set of simulation configurations. For a 
uniform liquid the positioning of the observation volume 
is irrelevant and many positions can be used simultane- 
ously. It is worth noting further that since the occupation 
numbers can only be a finite number of non-negative inte- 
gers, only a finite number of binomial moments ((?)) will 



be non-zero. Similar procedures apply to non-uniform 
systems such as the solvation structure near a fixed so- 
lute, but the positioning of the observation volume is then 
relevant. 

The observation volume may be subdivided into sev- 
eral strata. This introduces cross moments in the general 
case and permits conceptually interesting possibilities for 
modeling the probabilities. As an example, consider the 
mean density in a volume element external to a core re- 
gion conditional on the requirement that the core region 
be empty. With such considerations, the IT modeling 
naturally produces a prediction of the hydration struc- 
ture near a hydrophobic solute without special geomet- 
rical limitations. Such possibilities have scarcely been 
studied so far, however. For the calculation of the re- 
quired moment information, stratification is no particular 
problem but it may affect the computational efficiency. 
Note, that a stratified representation can also be deriv ed 
from coarse-graining the continuous IT of section II c . 

Next, we will illustrate several methods to calculate 
the second moment of the particle-number fluctuations. 
Then we will describe numerical methods for the entropy 
maximization. 

Calculation of particle-number variances. The second- 
moment constraint eq |3c] requires the calculation of a 
double integral over the volume v involving the distance- 
dependent pair correlation function g{r). Such integrals 
can be evaluated using Monte Carlo techniques. The 
average of the pair correlation function, (g(\r — r'|)) can 
be determined by placing two points r and r' randomly 
inside the volume v according to a uniform probability 
density. This average is then multiplied by the square of 
the volume v to give an estimate of the integral in eq 

As suggested above, the variance and higher moments 
of particle-number fluctuations can be determined di- 
rectly from molecular simulations. Unlike po, the mean 
and variance can be calculated accurately from insertion 
also for large solutes. Objects with volume v (shape 
and size) are inserted with random positions and ori- 
entations into water configurations taken from an equi- 
librium molecular dynamics or Monte Carlo simulation. 
The mean and variance of the particle number n is then 
determined directly. 

Volumes of certain shapes are amenable to analytical 
evaluation. For spherical volumes v of radius d, Hill 64 de- 
rived a transformation of the six-dimensional integral to 
one-dimensional form. Such transformations can be ac- 
complished by determining the pair-distance distribution 
P(R) well known from small-angle scattering theory, 65 



P(R)= j dr j dr'S(R-\r-r'\ 



(12) 



where 5(x) is Dirac's delta function. This leads to the 
following expression for the second moment: 



(n 2 ) = (n) +p 2 J dR P(R)g(R) . (13) 
We can also use the three-dimensional analog to eq |l2|, 
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P(R) = I dr I dr'S[R - (r - r') 



(14) 



Fourier transformation relates P(R) to the form factor- 
s' (k) of the volume v, 



(15) 



P(k) =y drP(r)e lkr = |5(k)| 2 , 

where 

S{k) = [ dre ,;k r . 

J V 

The Fourier transform eq |l5| can be inverted as 
1 



P(R) 



(2tt) 3 



dkP(k)e~ 



(16) 



(17) 



As a generalization of Hill's result for a single sphere, 64 
we find for a collection of N non-overlapping spheres of 
radius d: 



P(P) = AttR 

N-l N 

+ E E 

i=l j=i+l 



x . , 5 , 4 P P 3 \ 

AttcP 1 - 

3 d 12d 3 7 



6{2d- R) 



nd a ( -(l-x i3 



3 / 2 - IP - n 



3rij Rd 



(18) 



y*j 



5 / 2 - ip 



60yPd 3 



?[P - (r,j - 2d)]6»(r ii +2d-R) 



where is the distance of the centers of spheres i and 
j, x l3 = (P 2 + rij 2 - 4d 2 )/(2Pr y ), and Vij = P 2 + r 2 - - 
IRrijXij. Using this method, analytical results can also 
be found for rotation ellipsoids with axes a = b and c. 

Maximizing the information entropy. Maximization 
of the information entropy eq || is easily performed us- 
ing Lagrange multipliers for the constraints. For con- 
straints on mean and variance, this leads to the expres- 
sion eq ^ for p n . Lagrange multipliers Ao, Aj., and A2 
are then calculated such that the corresponding p n sat- 
isfy the moment conditions eq [| This corresponds to 
solving three non-linear equations with three unknowns 
which can be accomplished, for instance, using a Newton- 
Raphson method. Alternatively, one can use standard 
minimization packages, minimizing the squared differ- 
ences of the left and right-hand sides of the moment 
constraint equations. This is consistent with the idea 
of minimizing the "thermodynamic potential" 



/(A 1 ,...,A fcmax ) = ln 



1 + E ^ exp ( ~ E Xk 



k=l 



3=1 
J 



k=l 



(19) 



here expressed in terms of the binomial moment infor- 
mation and for a general default model. 



e. Some pitfalls of the simplest models 

It is a helpful, heuristic view that the IT approach 
studies a model grand partition function associated with 
a molecular scale volume and involving effective interac- 
tions. Because this may be an unusual setting for the 
consequent statistical thermodynamic calculations, con- 
ventional results are not guaranteed. Here we note some 
of the pitfalls that have been encountered for the sim- 
plest models. These discussions emphasize that the ori- 
entation for development of this approach is to discover 
models that work. 

Non-separability of chemical potentials of distant sites 
within a Gaussian model. Consider two spheres of exclu- 
sion radius A separated by a distance large compared to A. 
The excess chemical potential of the two spheres should be 
two times that of the individual spheres, ^ cx (l, 2) = 2/j, ex . 
The mean and variance of the particle-number fluctua- 
tions are also additive. For the combined volume, we will 
have 1712 = 2mi and o~2 2 = 2tri 2 , where rrij and a 2 are 
the means and variances of the particle-number distribu- 
tions of an individual sphere (i = 1) and the two spheres 
(i = 2). Following the flat default model, we approximate 
Po by a Gaussian form, 14 



Po 



e -mi 2 /2tTi 2 
V(W) 



(20) 



We then find that the corresponding chemical potential 
of an infinitely-distant pair of spheres is not exactly that 
' of two individual spheres, 



M ex (l,2)-2 M ° 



ln(7rcrri 



(21) 



This non-additivity is a direct consequence of the Gaus- 
sian model and can be simply repaired by insisting ac- 
curately on zero probability for negative occupancies. In 
fact, the definition of our problem here requires that the 
occupancies be non-negative integers. 

Additivity can be recovered by dividing the two sphere 
volume into two strata: one for each sphere or, when the 
spheres overlap, the individual strata can be defined by 
utilizing the plane of intersection of the spherical surfaces 
as a bounding surface. Suppose again that the spheres 
are far apart and that the probability of occupancy of 
one sphere is independent of the occupancy of the other. 
The cross correlation between the numbers n\ and 712 of 
solvent centers in the two non-overlapping strata is then 
given by 

(nin 2 ) = p 2 I dri dr 2 g(\ri - r 2 |) 

= p 2 v 1 v 2 = (ni)(n 2 ) , (22) 

corresponding to uncorrelated particle-number fluctua- 
tions. An expanded IT model uses the joint probabilities 
p(n\, n-i) of finding n\ particles in Vi and n 2 particles in 
V2 ■ Including the constraint eq |2^ derived from stratifi- 
cation in that expanded IT model gives the correct re- 
sult that the hydration free energy of the two spheres is 



G 



the sum of the hydration free energies of the individual 
spheres. 

The non-additivity in the simple model is caused by 
the possibility of having a zero occupancy of the volume 
v with a negative number — n of particles in sub-volume 
V\ that is compensated by a positive number n of parti- 
cles in sub- volume v-z- By imposing explicitly that each 
of the sub-volumes has zero occupancy when the total 
occupancy is zero, the non-additivity is eliminated. This 
argument illustrates that the restriction of the probabil- 
ities p n to non-negative integers can be significant. This 
example also emphasizes that arguments well justified 
about the center of a distribution can lead to fundamen- 
tal errors when applied to the wings of a distribution. 

Macroscopic limit. For the flat default model, we can 
calculate the limit of large solutes. The variance in the 
particle number is then given by the isothermal compress- 
ibility XT, 



(n 2 ) - (n) 2 
¥) 



pk B TxT for v — > oo 



(23) 



For the flat default model and a spherical solute with 
exclusion radius A, we find that the chemical potential 
grows with the volume of the solute, 



dp 
dX 



2ttA 2 

XT 



(24) 



where we used the Gaussian approximation eq |2C|. The 
flat default model predicts for the water density pG(oo) 
at contact with a hard wall: 

G(oo) = lim (47T\ 2 pk B T)- 1 ^ « (2pk B Tx T r 1 , (25) 

A^oo OA 

which for water under standard conditions results in 
G(oo) ~ 8. The contact value for a flat wall, however, is 
well known to be G(oo) = pj pk B T, where p is the pres- 
sure. If we use bulk density and pressure of water under 
standard conditions, we obtain p/pk B T w 7.4 x 1(T 4 ; for 
saturated water vapor, 38 we find p/pk B T w 2 x 10~ 5 . 
Clearly, the contact value at a flat wall, G(oo), would be 
grossly overestimated if the flat default model were used 
in this macroscopic limit. 

In the limit of large solutes we expect the multipha- 
sic nature of water to become important. Stillinger 38 
pointed out that the solvation of hard spheres much 
larger than a water molecule can be quantified by con- 
sidering that a molecularly thick vapor layer surrounds 
the solute. For large solutes, the formation of a cor- 
respondingly large water-vapor bubble with subsequent 
insertion into that bubble will cost less free energy than 
inserting directly into the liquid phase. The simplest IT 
model uses a flat or Gibbs default model and does not 
account for this possibility, as density fluctuations are 
of either Gaussian or Poisson-like character, respectively. 
More elaborate default models should however be able to 
account for the possibility of vaporizing a certain small 
volume. But clearly, defining such models would require 
knowledge about the free energy of microscopic vapor- 
bubble formation which is a task as formidable as the 
calculation of solvation chemical potentials. 



An indication of the large-solute effects can already 
be seen in Figure 2. For solutes of size d = 0.3 nm 
and larger, we find that p\ is depressed relative to the 
flat default model. This means that it is relatively less 
likely to find an isolated water molecule in a cavity than 
would be predicted from that simple model. Based on 
the foregoing discussion, for solutes of increasing size we 
expect that po will be become larger than predicted from 
the flat default model. 

f. Historical perspective 

Scaled particle theory ideas. The IT approach can be 
viewed in the context of preceding studies of the insertion 
probability po. Mayer and Montroll 66 and later Reiss et 
al. 36 expressed pq in terms of higher-order correlation 
functions of the bulk fluid, 



Po = 1 



^ (_i)t 



m— 1 
,( m ) 



dv x dr 2 - - dr m p 



x 5 ^(n,r 2 ,...,r m ) , 



(26) 



where the m-body joint density for solvent 

centers (here, water-oxygen atoms). These are standard 
combinatorial results, more frequently seen in forms such 



67 



as 



and 



Po 



i + £(-i) 



m— 



j + m 
j 



n 

j + m 



(27) 



(28) 



where n is the number of solvent centers within v. The 
first of these can be derived directly by noting that 
po = ((—1 + 1)"} where (—1 + 1)" is an indicator func- 
tion that is one for n = and zero otherwise. These sums 
truncate sharply if only a finite maximum number of par- 
ticles can be present in the observation volume. They are 
of practical value for small volumes where the maximum 
number of solvent centers that can possibly occupy that 
volume is small. For larger volumes, lar ge t erms of alter- 
nating sign appear in the expansion eq |27j that make it 
difficult to use estimated values for the factorial moments 
in this linear fashion. 

Equating the excess chemical potential p cx to the 
quasi-static work of creating a cavity results in an ex- 
pression for po i n terms of the contact value G(A) of the 
solvent density at the surface of the exclusion volume v. 
For a spherical exclusion volume of radius A we have: 36 ' 68 



p cx = -k B Tln Po = 4npk B T / G(X)X d\ . (29) 

Jo 

This result is formally exact for a hard-sphere solute and 
establishes that 4TrpfcB2 n G(A)A 2 is the compressive force 
exerted by the solvent on a hard spherical solute of diam- 
eter A. If the solvent is considered to contain a hard core 
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itself, there is a maximum number of solvent particles 
Umax that can occupy a certain volume v. This results 
in useful, exact expressions for the case of n max = 1 and 
2. For larger solutes, the expected macroscopic solvation 
behavior 37,38 

p cx ~ p(47rA 3 /3) + 7 (4ttA 2 ) , (30) 

is used as a basis for extrapolation. Here p is the pressure 
and 7 is the liquid-vapor surface tension. In the inter- 
mediate molecular region, the approximate asymptotic 
form is connected smoothly to the exact curve for small 
solutes. 37 ' 38 

Computer simulation studies. The cavity formation 
probability po lends itself to direct computer simula- 
tion studies by test-particle insertion. Spherical vol- 
umes v in water were studied extensively using com- 
puter simulations. 69 ' 70 ' 71 ' 72 ' 73,74 Similar insertion meth- 
ods were also used to study polymer solubilities. 75 The 
simulation studies of Pohorille and Pratt 69 ' 70 ' 71 clarified 
a number of interesting speculations on hydrophobicity 
and provided the first discriminating tests of theoretical 
models for G(A). 




R(A) 

Figure 4. Distributions p nn of the distance R + Rs 
from a random point to the nearest neighboring site in 
water, n-hexane, and an ideal gas with water density. 71 
Rs is the van der Waals radius of the solvent. For each of 
the three fluids, the solvent van der Waals volumes are 
represented by a superposition of spherical sites of one 
type only. The ideal-gas curve is the Hertz distribution 
4irp(R + R s ) 2 exp[-4?rp(.R + i?s) 3 /3] with p the molec- 
ular (or oxygen) density of liquid water at 301 K and 
atmospheric pressure. The van der Waals radius of wa- 
ter oxygen is Rs — 1.35 A; that of united-atom carbons 
in n-hcxane is Rs — 1.85 A. 

One hypothesis that has sparked interest in recent 
years is the idea that the low solubility of inert gases 
in liquid water is due to the small size of the water 
molecule and, consequently, the "interstitial" cavities will 
be smaller in water than in coexisting organic liquids of 
common interest. 76 ' 77 Since the present work deals di- 
rectly with such cavities, it is straightforward to test that 
idea. Figure 4 shows distributions 71 of the distances from 
an arbitrary point to the nearest interaction site for three 
cases of interest: liquid n-hexane, liquid water, and an 



ideal gas with water density. Reasonable and customary 
values are used for the van der Waals radii of the solvent 
interaction sites in each case. Figure 4 shows that the 
most probable cavity radii are quite small in all those 
cases as is expected for liquids. Also, the difference in 
the most probable cavity size between liquid n-hexane 
and liquid water is not large, and smaller than the cor- 
responding difference between liquid water and an ideal 
gas with water density. The fact that the differences 
seen here are slight is, presumably, defined by the fact 
that the most basic units considered in n-hexane are the 
methyl and methylene groups. These are not so different 
in size from a water molecule. In contrast, simple equa- 
tion of state models might treat the n-hexane molecule as 
a sphere of substantially larger size. It can also be noted 
that on a packing fraction basis, typical organic liquids 
are denser than liquid water. 70 

One feature in Figure 4 that does distinguish water 
is the breadth of the distributions. The distribution of 
the nearest solvent site neighboring an arbitrary position 
in the liquid is narrower in liquid water. The widths of 
these distributions are independent of the van der Waals 
radii assigned to the solvent sites. The interpretation is 
that, compared to other liquids, water is "stiffer" when 
larger cavities must be opened. 
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Figure 5. Contact densities G(A) of water and n- 
hexane at the surface of hard-sphere solutes. 69 G(A) gives 
the compressive force of the solvent on a hard spherical 
solute. 

This comparative stiffness can be analyzed further by 
determining the contact functions G(A) for water and hy- 
drocarbon liquids and comparing the compressive force 
that such solvents exert on a hard spherical solute. As 
shown in Figure 5, water exerts a higher compressive 
force on such a solute than typical hydrocarbon sol- 
vents. It is thus an accurate view on molecular-scale hy- 
drophobic effects that water "squeezes-out" hydrophobic 
solutes. 78 ' 79 

Field theory models. A perspective related to the 
IT approach pursued here has been adopted before by 
Chandler. 52 Equilibrium fluctuations of the water density 
are approximated by a Gaussian model. The partition 
functions are then calculated for the two systems with 
and without a cavity of volume v present, using a field- 
theoretic method. Chandler then shows that this con- 



8 



tinuous Gaussian fluctuation model leads to the Pratt- 
Chandler integral equation theory. 47 For sufficiently large 
volumes, a Gaussian field model is expected to reproduce 
accurately the p n if the continuous probability densities 
of particle numbers are mapped to discrete values of n 
and the negative values of n are disregarded. 
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Figure 6. Excess chemical potential of hard-sphere 
solutes in SPC water 57 as a function of the exclusion 
radius d. The symbols are simulation results, compared 
with the IT prediction using the flat default model (solid 
line). 50 

III. Results for elementary hydrophobic effects 

a. Hydration of simple spherical solutes 

The probabilities p n of observing exactly n oxygen cen- 
ters inside spherical volumes, shown in Figure 2, are re- 
produced accurately using IT with a flat default model. 
The corresponding excess chemical potentials of hydra- 
tion of those solutes, /i cx = — ksTlnpo, are shown in Fig- 
ure 6. 50 As expected, /z ex increases with increasing cavity 
radius. The agreement between IT predictions and com- 
puter simulation results is excellent over the whole range 
d < 0.36 nm accessible to direct simulation calculations 
of p Q . 
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Figure 7. PMF between two methane-size cavities in 
SPC water, 57 comparing explicit computer simulations 81 
(dashed line) with the IT prediction 50 (solid line). Note 
that the simulation curve is based on continuous solute- 
solvent interactions, subtracting the methane-methane 
interaction potential from the methane-methane PMF. 



b. Methane-methane potential of mean force 
(PMF) 

We now focus on the simplest model of hy- 
drophobic interactions exemplified by the free en- 
ergy of bringing together two methane-size cavities in 
water. 47 - 49 ' 80 - 81 ' 82 ' 83 ' 84 ' 85 This free energy profile de- 
fines a PMF. We obtain the cavity contribution to the 
methane-methane PMF by calculating the excess chem- 
ical potential, /x ex (r), of a cavity volume described by 
two spheres each of radius d and separated by a dis- 
tance r. The PMF is then defined as W(r) = fj, cx (r) — 
lim^oo fi cx (s). This definition guarantees that W(r) — > 
for r — > oo, avoiding the problem of non-additivity dis- 
cussed above for non-stratified volumes. 

Figure 7 shows a comparison of the cavity PMF cal- 
culated using IT with that obtained from explicit simu- 
lations by Smith and Haymet. 81 The flat default model 
has been used here and throughout this paper, unless 
otherwise noted. Also, an exclusion radius of d = 0.33 
nm has been used for methane-size cavities correspond- 
ing to a distance where the methane- water oxygen radial 
distribution function reaches a value of 1.0 in commonly 
used models. 86,87 The direct interaction between the two 
methane molecules has been subtracted from the simula- 
tion PMF to get approximately the cavity contribution 
to the methane-methane PMF. 
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Figure 8. Water contribution to the torsional PMF 
of butane from IT 50 and explicit computer simulation 88 
(dashed line). 



In agreement with the simulation data, the PMF calcu- 
lated using IT shows features characteristic of hydropho- 
bic interactions. A free energy minimum is observed cor- 
responding to overlapping hard spheres. That deep min- 
imum is separated by a barrier from a second minimum 
at a distance of about 0.7 nm. The second minimum 
corresponds to solvent-separated pairs of methane-size 
cavities. In addition, we observe a third shallow min- 
imum which was also seen in computer simulations by 
Liidemann et al. 85 The equilibrium between the solvent- 
separated and contact minimum as a function of pressure 
provides insights into pressure denaturation of proteins, 
as described in section [v| 
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c. Solvent contributions to the torsional equilib- 
rium of butane 

As a final example of a simple hydrophobic effect, Fig- 
ure 8 shows the cavity contribution to the torsional PMF 
for n-butane in water. IT calculations are compared with 
explicit computer simulations by Beglov and Roux. 88 We 
find that the the cavity PMF favors the compact cis 
(4> = 0) structure over the extended trans structure by 
about 1.8 kJ/mol. The gauche structure (0 = 7r/3) is 
favored over trans by about 0.7 kJ/mol. Those differ- 
ences and the overall cavity PMF for the torsional iso- 
merization of butane are in excellent agreement with the 
simulation data. 
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Figure 9. Chemical potential of hard-sphere so- 
lutes with exclusion radii corresponding to neon, argon, 
methane, and xenon as a function of temperature along 
the saturation curve of water. 14 The solid line is the IT 
prediction. The symbols are computer simulation data 
from insertion. Both simulation and theory data are 
based on SPC water 57 data. 



IV. Temperature dependence of hydrophobic hy- 
dration 

a. Solvation chemical potentials 

The hydrophobic effect is often associated with charac- 
teristic temperature dependences. 11 ' 89 One of the more 
astonishing observations is that the entropies of trans- 
fer of nonpolar molecules from gas phase or a nonpolar 
solvent into water converge at a temperature of about 
400 K to approximately zero entropy change. Similar 
behavior was also seen in microcalorimetry experiments 
on unfolding of several globular proteins. 6 ' 7,8 This anal- 
ogy to the transfer data supported the hydrophobic-core 
model of protein folding: during unfolding predominantly 
nonpolar residues are transferred from a mostly nonpolar 
protein core into an aqueous environment. 

We have used the IT model to clarify the molecular ori- 
gin and the quantitative details of this striking entropy- 
convergence behavior. 14 Computer simulations were car- 
ried out to calculate the water-oxygen radial distribution 
function g(r) at several temperatures along the experi- 
mental saturation curve of water. The <7(r)'s together 



with the corresponding water densities are used as input 
in the IT model to calculate chemical potentials of hydra- 
tion for hard solutes. Solute sizes of d — 0.28, 0.31, 0.33, 
and 0.345 nm were considered, corresponding approxi- 
mately to neon, argon, methane, and xenon, respectively. 

Figure 9 shows the chemical potentials for each of the 
spherical solutes as a function of temperature T, for T be- 
tween 275 and 625 K. Also shown are the results of direct 
simulation calculations using test-particle insertion to- 
gether with representative statistical errors. We find ex- 
cellent agreement between simulation and theory for the 
chemical potentials over the whole temperature range. 
The chemical potentials are approximately parabolic ex- 
cept at the highest temperatures, and the curves for dif- 
ferent solute sizes are shifted vertically with the maxi- 
mum at about 400 K. 
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Figure 10. Hydration entropy of hard-sphere so- 
lutes with exclusion radii corresponding to neon, argon, 
methane, and krypton as a function of temperature along 
the saturation curve of water. 14 Shown is the entropy de- 
fined in eq |3l| . 

We obtain a solvation entropy by taking the derivative 
of the chemical potential along the saturation curve, 




(31) 



Additional equation-of-state contributions to the stan- 
dard solvation entropy are negligible, estimated to be less 
than 1 and 10 J/(mol K) for temperatures T < 450 and 
T < 550 K, respectively. Figure 10 shows the temper- 
ature dependence of the entropy S^at for the different 
solutes. The entropies are large and negative at room 
temperature for all the solutes and decrease in magni- 
tude with increasing temperature. The temperature de- 
pendence of entropies is approximately linear with slopes 
increasing with the increasing solute size. The resulting 
heat capacity, 



T 



fdS st 



\ dT 



(32) 



is large and positive (approximately 40 cal mol -1 K _1 for 
a methane-size cavity). Moreover, the entropies converge 
at about 400 K to approximately zero entropy, although 
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at closer inspection the temperature range of the conver- 
gence region is several 10 K and the entropy is not exactly 
zero at convergence. To understand the origin of this ap- 
proximate convergence we determine in the following the 
factors entering into the IT model. 



H ex = Tp 2 (T)x(v) 




H e * = Tp 2 (T)x(v) + Ty(v) 




Figure 11. Schematic representation of the different 
factors resulting in approximate convergence of solvation 
entropies. 
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Figure 12. Comparison of the isothermal compressibil- 
ity of water and nonpolar solvents as a function of tem- 
perature along their respective saturation curves. Data 
are taken from Ref. 90. 



b. Origin of entropy convergence in hydrophobic 
hydration and protein folding 

To quantify the different factors contributing to the 
entropy convergence, 1 we use the approximate Gaussian 
representation eq ^0] for po with mean m = (n) = pv and 
variance a 2 — (n 2 ) — (n) 2 . This results in an estimate of 
the chemical potential, 

^ cx » Tp 2 {k B v 2 /2a 2 } + T{k B ln(27rcr 2 )/2} . (33) 

The second term is smaller than the first, and depends 
only logarithmically on the solute size. The solvation 
chemical potential may therefore be lowered by lower- 
ing the product of temperature and density Tp 2 (T), or 
by increasing the particle- number fluctuations a 2 . The 
temperature dependence of Tp 2 (T) along the saturation 
curve is non-monotonic. On the other hand, we find that 
the variance a 2 (T, v) changes only little over the range of 
temperatures in Figure 9. 14 Accordingly, we can approx- 
imate the chemical potential as 

f*Tp 2 (T)x(v)+Ty(v) , (34) 

where the functions x(v) = k B v 2 /2a 2 and y(v) = 
fes ln(27T(7 2 )/2 depend only on the solute volume, not on 
the temperature. The contributions of the various terms 
to the entropy convergence are illustrated schematically 
in Figure 11. The term Ty(v) in eq^ is generally small. 
If we neglect Ty(v), the solvation entropies eq [5l] con- 
verge exactly at zero entropy for solutes of different size, 
and the temperature of convergence corresponds to the 
maximum of Tp 2 (T) along the saturation curve. When 
the second term Ty(v) is included, the entropy at conver- 
gence is different from zero, but the convergence is still 
exact. However, both x(v) and y(v) show weak temper- 
ature dependences. When those are considered as well, 
the resulting entropy convergence is no longer exact but 
occurs over a temperature range that is several 10 K wide 
for the solutes considered here. 

Our analysis shows that the main factors leading to 
an entropy convergence are derived from the proper- 
ties of pure water. They are: (1) the non-monotonic 
behavior of Tp 2 (T) along the saturation curve and (2) 
the weak temperature dependence of the particle-number 
fluctuations <r 2 (T,v) for solute excluded volumes. The 
macroscopic analog of a 2 is the isothermal compressibil- 
ity xt- For macroscopic volumes v, eq ^3] relates the 
two, a 2 = p 2 vk B TxT- Figure 12 compares the isother- 
mal compressibility of water and several organic solvents 
up to their critical point as a function of temperature. 
Indeed, we find that water shows far weaker tempera- 
ture variations of \t than the organic solvents, which 
can be seen as the origin of the entropy convergence. In 
the macroscopic limit, the variance a 2 — p 2 vk B TxT is 
a product of Tp 2 (T) with a maximum at about 440 K 
and xt with a minimum at about 320 K. This results in 
only small variations of a 2 in the range of temperatures 
between 273 and 420 K. 
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Figure 13. Pressure dependence of the PMF between 
two methane-like solutes in water. The PMF's were 
calculated from information theory using a flat default 
model. 15 A methane-methane Lennard- Jones interaction 
was added. SPC water 57 g(r)'s were used. The arrows 
indicate changes with increasing pressures from —160 to 
7250 bar. 

V. Pressure denaturation of proteins 

a. Denaturation by water transfer into the pro- 
tein interior 

Proteins can be denatured by pressures of typically 
a few kilobar. 19,20 ' 21 ' 22 ' 23 ' 24 ' 25 ' 26 This pressure-induced 
unfolding has been a long-standing puzzle in our under- 
standing of protein stability. The "liquid-hydrocarbon 
model" of protein unfolding 11 explains unfolding as a 
transfer of predominantly hydrophobic residues from the 
protein interior into the aqueous solvent. Indeed, this 
model accurately describes the thermodynamics of tem- 
perature denaturation, in particular, the convergence of 
the entropies of unfolding. However, as Kauzmann 13 
pointed out, such a description fails to explain the pres- 
sure denaturation of proteins. In particular, the volume 
changes associated with transferring hydrocarbons from 
a nonpolar phase into water exhibit behavior exactly op- 
posite to that upon pressure unfolding as a function of 
pressure. The volume change of hydrocarbon transfer is 
negative at low pressure and positive at high pressure; 91 
the volume change of protein unfolding is positive at low 
pressure and negative at high pressure. 13 

To explain the pressure denaturation of proteins, we 
invoke additional information about the structural prop- 
erties of pressure-denatured proteins. Nuclear mag- 
netic resonance (NMR) measurements reveal consider- 
ably more structural organization in pressure-denatured 
proteins compared to that in temperature-denatured 
proteins. 27 These and other experiments 21 ' 22 ' 23 ' 24 indi- 
cate that the pressure-denatured proteins are relatively 
more compact than temperature-denatured proteins. Ac- 
cordingly, we depart from the commonly used model ex- 
plaining protein unfolding thermodynamics as a transfer 
of nonpolar residues from the protein interior into so- 
lution. Instead, we describe pressure denaturation as a 
transfer of water molecules from the aqueous phase into 
the hydrophobic core of a protein. 



To quantify this inverted "liquid-hydrocarbon model," 
we focus on the stability of hydrophobic aggregates 
formed in water as a function of pressure. Aggregate 
formation of spherical, methane-size solutes serves as an 
idealized model of the stability of protein hydrophobic 
cores. The free energy of forming such aggregates can be 
expressed in terms of two-body and higher order PMF's 
between the methane-size solutes in water. In the follow- 
ing, we will study the pair and three-body interactions of 
methane-like solutes in water which control the stability 
and formation of larger aggregates. 

b. Effect of pressure on hydrophobic interactions 

Figure 13 shows the PMF's W(r) between two 
methane-like particles in water for pressures up to about 
7 kilobar (700 MPa) calculated using IT. 15 The PMF's 
exhibit two minima, a contact minimum at about 0.4 
nm distance and a solvent-separated minimum at a dis- 
tance of about 0.7 nm. The two minima are separated 
by the desolvation barrier. The PMF's are normalized 
at the solvent-separated minimum to illustrate that the 
contact minimum loses importance with increasing pres- 
sure. The free energy of the contact minimum increases 
relative to the solvent-separated minimum by about 0.9 
kJ/mol when the pressure is increased from 1 bar to 7 
kilobar. In addition, the desolvation barrier between the 
two minima also increases with increasing pressure. Sim- 
ilar pressure destabilization of contact configurations was 
observed for three methane molecules in contact. 15 

The effect of pressure on pair PMF's between two 
methane molecules has also been studied by computer 
simulation. 92 These simulation calculations showed a 
pressure destabilization of contact pairs that is com- 
parable in magnitude to the IT calculations. In addi- 
tion, simulation calculations also showed the formation 
of methane aggregates in water at low pressures that dis- 
solved at high pressures. 93 

The pressure destabilization of hydrophobic contact 
configurations can be understood from the following sim- 
plified model: At low pressures, the interstitial space be- 
tween two large nonpolar solutes is energetically unfavor- 
able for water molecules. 55,94 As the pressure increases, 
the space between the nonpolar solutes is more likely 
to be occupied by water, increasing the importance of 
solvent-separated configurations relative to contact con- 
figurations. A more detailed analysis indeed shows that 
the destabilizing effect of pressure on hydrophobic con- 
tacts increases with increasing size of the two nonpolar 
solutes. 15 

c. Stability of proteins 

We have shown that the stability of hydrophobic aggre- 
gates decreases with increasing pressure as a result of wa- 
ter penetration. When applied to the hydrophobic-core 
model of proteins, we predict that pressure denatures 
proteins by swelling. 15 A swelling mechanism has also 
been proposed for urea-induced protein denaturation. 95 
Under sufficiently high pressures water molecules will 
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intercalate into the hydrophobic protein interior. The 
resulting structures of pressure-denatured proteins arc 
predicted to be more ordered and more compact than 
those of temperature-denatured proteins. Recent small- 
angle scattering experiments 28 indeed showed that the 
pressure-denatured staphylococcal nuclease protein has 
a considerably smaller radius of gyration of R g « 3.5 
nm compared to the temperature-denatured protein with 
R g « 4.6 nm. 

d. Formation of clathrate hydrates 

This physical picture of pressure denaturation of pro- 
teins is relevant also for the formation of clathrate hy- 
drates, crystalline compounds forming at elevated pres- 
sures from non-polar molecules and water. As solid de- 
posits in gas pipelines, clathrate hydrates cause problems 
in natural gas transmission. Methane- water clathrate hy- 
drates, for instance, form at pressures above about 44 
MPa at a temperature of 298 K, with a stoichiometry of 
about 6-7 water molecules per methane and a methane- 
methane nearest-neighbor distance of about 0.62-0.74 
nm. 96 At lower pressures, the two fluid phases sepa- 
rate. Formation of the clathrate structure is consistent 
with stabilizing the solvent-separated minimum in the 
methane-methane PMF relative to the contact minimum, 
as predicted from the analysis of pressure effects on hy- 
drophobic interactions. In addition, the present theory 
predicts that larger hydrocarbons require lower pressure 
for clathrate formation, 15 in agreement with the experi- 
mental observations that led to the gas-gravity method 
of phase determination. 96 
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Figure 14. Pressure dependence of the desolvation bar- 
rier between the contact and solvent-separated minimum 
in the methane-methane PMF of Figure 13. The activa- 
tion free energy for forming (open squares) and breaking 
hydrophobic contacts (filled squares) is shown as a func- 
tion of pressure. 

e. Pressure effects on protein folding kinetics 

We noted before that increasing pressure results in 
a higher desolvation barrier between the contact con- 
figuration and the solvent-separated minimum. Fig- 
ure 14 shows the barrier heights AWj and AW"* from the 
solvent-separated and contact minimum, respectively, as 



a function of pressure. Thus, AWj and AW* arc the 
barriers for forming and breaking hydrophobic contact 
configurations, corresponding to "folding" and "unfold- 
ing" reactions, respectively. We find that AWj and 
AW* both increase approximately linearly with increas- 
ing pressure. Activation volumes defined as Av^ , = 
dAWjy u /dp are both positive, Avj = 3.8 ml/mol and 

AvJ = 1.6 ml/mol, for folding and unfolding reactions, 
respectively. Increasing pressure is thus expected to slow 
down both the "folding" and the "unfolding" reactions. 

Indeed, experimental studies of the pressure-dependent 
folding and unfolding kinetics of the protein staphylococ- 
cal nuclease showed a reduction of both rates with in- 
creasing pressure. 97 The experimentally observed activa- 
tion volumes for folding and unfolding of staphylococcal 
nuclease are AVj — 92 ml/mol and AV$ — 20 ml/mol, 
respectively. From the ratio of overall experimental ac- 
tivation volumes and theoretical activation volumes cal- 
culated per hydrophobic contact, we can estimate the 
number of hydrophobic contacts broken in the folding 
transition state. For staphylococcal nuclease we estimate 
that number to be between 10 and 25, in good agreement 
with the predictions of energy landscape theory. 15,98 
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Figure 15. Excess chemical potential of hard-sphere 
solutes in aqueous NaCl solution as a function of solute 
size, given by the exclusion radius with water, and salt 
concentration. 

VI. Salt effects on hydrophobic hydration 

Addition of salts to water generally reduces the solu- 
bility of nonpolar solutes. The Hofmeistcr scries 29 ranks 
salts, in part, according to this reduction of the solubility 
of nonpolar compounds. Here, we do not attempt to pro- 
vide a complete description of salt effects on hydrophobic 
hydration. Instead, we present some preliminary results 
for the solubility of small hard-sphere solutes in a solu- 
tion of sodium-chloride (NaCl) in water. In particular, 
we compare results for NaCl concentrations of 1, 3, and 
5 mol/1 as well as pure water. The three salt solutions 
were studied using molecular dynamics simulations ex- 
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tending over 1 nanosecond each. For simulation details, 
see Rcf. 99. 

Figure 15 shows the excess chemical potential of spher- 
ical solutes as a function of their exclusion radius d with 
water. The solute-solvent interactions are modeled as 
those of hard spheres with radii i? w = 0.132 nm of wa- 
ter, i?Na = 0.085 nm of Na + , and Rci = 0.18 nm of 
Cl~. The exclusion radii with water, Na + , and Cl _ arc 
d = R + i? w , R + i?Na, and R + Rci, respectively, where 
R is the solute hard-sphere radius. We find that adding 
salt indeed increases p cyi and thus decreases the solubil- 
ity for a solute of a given size. For a methane-size solute 
(d = 0.33 nm), the excess chemical potential increases 
by about 10 kJ/mol from about 34 to 44 kJ/mol when 
increasing the NaCl concentration from to 5 mol/1. 

To describe the salt dependence using the IT approach, 
we extend our formulation to solvent mixtures. For 
ionic solutes, the strong interactions between water and 
ions give rise to the formation of compact hydrated ions 
which result in non-trivial density fluctuations near ions. 
Therefore, instead of trying to model the full distribu- 
tion of particle number fluctuations p(nw, «Na, nci), we 
focus on a more tractable subset, which contains the 
probability of the event we are ultimately interested in, 
p(uw = 0,nNa = 0, nci = 0). Further, we factorize this 
probability into an ionic part and a conditional water 
part, 



p{n w = 0, n Na = 0, nci = 0) = p(n Na 
xp(n w = 0|n Na = 0, nci = 0) , 



= 0,nci = 0) 



(35) 



where p(nw = 0|nN a = 0, nci = 0) is the conditional 
probability of having no water-oxygen atoms in the wa- 
ter exclusion volume, given that there are no ions in 
the respective ion exclusion volumes. Such a factor- 
ization combined with accurate simulation data pro- 
vides insights into salt effects on hydrophobic solubili- 
ties at a molecular level. The contributions to the so- 
lute chemical potential, — kBTliip(n^i a = 0, nci = 0) 
and — kftTh\p(nw = 0|riNa = 0,nci = 0) both increase 
with increasing NaCl concentration. The increase in the 
salt contribution is primarily due to an increase in the 
mean of the p(n^ a , nci) distribution. Interestingly, the 
mean of p(nw|nNa = 0, nci = 0) is approximately inde- 
pendent of salt concentration. It is the decrease in the 
variance of this conditional water distribution with in- 
creasing NaCl concentration that leads to a decrease in 
f)(uw = 0|nNa = 0, nci = 0). The direct salt effect of the 
reduction in free volume due to overlap with salt ions and 
the indirect salt effect of changing the water structure ac- 
count for roughly one half each of the total increase in 
solute chemical potentials. 

VII. New directions: Cavity expulsion and area 
laws 



We mentioned in section lie that for macroscopic so- 



the solvation chemical potential is largely determined by 
the free energy of forming a vapor bubble of appropriate 
size. The free energy of forming large bubbles can be 
approximated by the product of surface area and liquid- 
vapor surface tension, since the contribution from the 
pressure- volume work is small for water. 38 Accordingly, 
we expect that the solvation chemical potential grows 
with the surface area for large solutes. Stillinger 38 gave 
a justification of this based on the water structure near 
a large hard solute. The contact-value theorem gives, in 
the limit of a flat hard wall, a water density at contact of 
about 7.4 x 10 -4 times the bulk density of water. Still- 
inger argued that this "dewetting" implies the presence of 
a molecularly thick vapor layer near a hard wall, and as a 
result, area laws with surface tension coefficients similar 
to that of the liquid-vapor surface tension of water. 

In an attempt to bridge the gap between microscopic 
and macroscopic solvation behavior, we have recently 
developed a quantitative description of the solvation of 
nonpolar solutes as a function of their size. 55 The wa- 
ter structure near spherical 55 and non-spherical Lennard- 
Jones solutes 100 ' 101 was found to be largely insensitive to 
the details of the solute- water interactions except for the 
solute size, motivating the use of perturbation theory. 
For spherical solutes, we find that the radial distribution 
function of water around the solute can be described ac- 
curately by 



9sw(r) = exp[-/3u rcp (r) - f3u>{r) + C(r)] 



(36) 



lutes physical effects intrude that are not captured in the 
simplest IT models. This is a consequence of the mul- 
tiphasic nature of water. In the limit of large solutes, 



where u rep (r) is the repulsive solute- water interac- 
tion from, e.g., Weeks- Chandler- Anderson separation; 102 
C(r) is a renormalized attractive interaction; and u)(r) 
represents the free energy of a test water molecule that 
does not interact with the solute as a function of the dis- 
tance r from the solute, 55 thus containing the non-trivial 
solvent contributions. Interestingly, u)(r) is directly re- 
lated to the one-body Lagrange multiplier u;W(r) of the 
continuous IT eq El[ We find that ui(r) is dominated by a 
contribution u>o(r— ro) that is identical for various solutes 
except for a simple radial translation with the solute size 
ro. The remainder Acj(r — ro), however, changes non- 
trivially with the solute size, acting as a cavity-expulsion 
potential for the test water molecule. 

The cavity-expulsion potential, Aw(r — ro), quantifies 
primarily the loss of energetic interactions of a test wa- 
ter molecule as it crosses the solute- water interface from 
the water phase towards the center of the solute. For 
small solutes, Aw(r — ro) is negligible. As the solute 
size increases, this loss of interactions becomes impor- 
tant, reaching —fJ^ } the negative excess chemical poten- 
tial of water, in the limit of large solutes. For solutes 
of diameters more than about 0.5 nm the cavity expul- 
sion results in a slight depression in the water density 
at the solute-water interface. We find that this "weak 
dewetting" with increasing solute size is sufficient to give 
an approximate surface-area dependence of the solvation 
chemical potential, which would otherwise be dominated 
by solute volume terms. 55 
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VIII. Concluding remarks 

How is water different from hydrocarbon liquids as a 
solvent for nonpolar solutes? This is the foremost ques- 
tion asked of theories of hydrophobic effects and several 
of the points above constitute relevant parts of the an- 
swer. It is appropriate therefore to ask this question more 
specifically and to organize features of the answer that 
are provided by the IT models considered here. One ba- 
sic aspect of the answer is straightforward and on that 
score we do not contribute anything further: water is dif- 
ferent because of the mechanical potential energies of the 
interactions among water molecules, including hydrogen 
bonding, orientational specificity, and cooperativity. 

In the context of solvation thermodynamics, the ques- 
tion how water differs from other solvents solicits infor- 
mation about a particular structural pattern of the sol- 
vent that might be principally responsible for hydropho- 
bic hydration free energies. For example, clathrate-like 
structures have been invoked to explain hydrophobic hy- 
dration phenomena. 103 The answer given by the IT model 
is that water differs from common hydrocarbon solvents, 
in the first place, because the distribution of pairs of ex- 
clusion spheres, oxygen-oxygen sites in water, is distinc- 
tive and has a distinctive temperature dependence. In 
the second place, the isothermal compressibility of water, 
that can be obtained by integration of the oxygen-oxygen 
distribution, is insensitive to temperature compared to 
the same property of organic solvents. Since the mini- 
mum in the temperature dependence of the isothermal 
compressibility is very distinctive, this answer is remark- 
able and remarkably simple. 

From our analysis of pressure effects on hydrophobic 
interactions and protein stability, we can deduce fur- 
ther peculiarities of water. Increasing the pressure in 
water shifts the balance from attractive and directional 
hydrogen-bonding forces to repulsive and isotropic pack- 
ing forces between water molecules. 104 This results in 
"energetic frustration" of water molecules in the water 
phase, 105 which in turn reduces the relative cost of incor- 
porating water molecules into a hydrophobic aggregate. 
At least two factors make water special in this context: 
its large negative excess chemical potential prevents wa- 
ter from penetrating hydrophobic aggregates under nor- 
mal conditions, 55 thus driving the formation of protein 
hydrophobic cores. Under sufficiently high pressure, how- 
ever, the relatively small water molecules occupy pre- 
viously empty cavities, exploiting imperfections in the 
packing of the protein interior. Note that the small size 
of water molecules here is not used as an argument that 
"interstitial" cavities are smaller in water than in typical 
organic liquids, 76:77 as discussed in section [If, but that 
water molecules can occupy small cavities in the protein 
interior. This incorporation of water molecules is favored 
further by the versatility of water molecules which can 
form favorable hydrogen-bonding interactions in many 
different environments, particularly in the interior of a 
protein which is never entirely nonpolar. 

These arguments also explain in part the success of 
simple theories, such as SPT and PC, that take no spe- 



cial account of the known details of hydrophobic solva- 
tion patterning. These theories yield satisfactory predic- 
tions for the simplest hydrophobic effects in part because 
they incorporate information about the equation of state 
and the oxygen-atom pair correlations of water. A next 
stage of basic molecular theory of hydrophobic effects 
will surely address the hydration structure in more detail 
while at the same time preserving equation of state infor- 
mation such as the low, temperature insensitive values of 
the isothermal compressibility that have been identified 
as particularly important. 

IT models of the solvation thermodynamics can eas- 
ily be generalized to various aqueous and non-aqueous 
solvents and mixtures, combined with other approaches, 
or extended to "unusual" models of water, such as a re- 
cently proposed two dimensional model of water 106 or an 
isotropic water model without directional hydrogen-bond 
interactions. 83 We have successfully adapted the method 
to study the solubility of small molecules in polymeric 
fluids. 107 ' 108 Crooks and Chandler 109 compared solva- 
tion chemical potentials of hard spheres of varying sizes 
in hard-sphere fluids to simulation data, finding that a 
Gibbs prior gives a good description of the density fluctu- 
ations and the solvation thermodynamics. Applications 
of the IT method to study phase equilibria have been 
suggested. 110 Lum, Chandler, and Weeks 111 have com- 
bined field-theoretic methods 52 with the IT approach to 
describe the solvation thermodynamics and structure of 
mesoscopic and macroscopic spherical solutes in water 
which is tightly associated with "dewetting" of the so- 
lute surface. 38,55 

The development of IT models is an ongoing effort tar- 
geting several directions. A better description for molec- 
ularly large solutes can be expected from improved, phys- 
ically motivated default models that incorporate the mul- 
tiphasic nature of water to account for the free energy 
of forming microscopic vapor bubbles. 111 IT models can 
easily be generalized to study hydrophobic interactions 
in inhomogeneous environments, for inst ance, to describe 
ligand binding to proteins. Section [I c gives an outline 
of a continuous IT, which establishes connections to the 
concepts of cavity expulsion and dewetting of nonpolar 
surfaces. 55 The volume stratification discussed in section 



II d leads naturally to predictions of the hydration struc- 



ture. 

In conclusion, we believe that the basic IT model pro- 
vides a simple theoretical framework to study many hy- 
drophobic phenomena, as it already led to a new under- 
standing of the temperature dependence of hydrophobic 
hydration, 14 and resulted in a new description of the pres- 
sure denaturation of proteins. 15 
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